CASC: A Cache-Aware Scheduling Algorithm For Multithreaded Chip Multiprocessors
نویسندگان
چکیده
In this paper we propose CASC, a cache-aware operating system scheduling algorithm for multithreaded chip multiprocessors (CMT). CMT is emerging as a popular architecture for server platforms, and most major hardware manufacturers plan or already have released CMT processors. It is the job of the operating system to manage the shared resources of the hardware, and the second-level (L2) cache is a critical shared resource in CMT processors. We propose an operating system scheduling algorithm, CASC, that improves management of the L2 cache. CASC works by co-scheduling threads that collectively achieve a low L2 miss rate and by giving priority to threads that do not require much space in the L2 cache. In this paper we describe the design of CASC and its implementation in Solaris 10. We show that CASC reduces L2 miss rates by 15-46% and achieves improvement in processor throughput of 28-50%.
منابع مشابه
A Systematic Measurement of the Influence of Non-Uniform Cache Sharing on the Performance of Modern Multithreaded Programs
Most modern Chip Multiprocessors (CMP) feature shared cache on chip, whose influence on the performance of multithreaded programs, unfortunately, remains unclear due to the limited coverage of the deciding factors in prior studies. In this work, we conduct a systematic measurement of the influence using a recently released CMP benchmark suite, PARSEC, with a spectrum of factors considered. The ...
متن کاملPerformance of Multithreaded Chip Multiprocessors and Implications for Operating System Design
We investigated how operating system design should be adapted for multithreaded chip multiprocessors (CMT) – a new generation of processors that exploit thread-level parallelism to mask the memory latency in modern workloads. We determined that the L2 cache is a critical shared resource on CMT and that an insufficient amount of L2 cache can undermine the ability to hide memory latency on these ...
متن کاملOperating System Scheduling for Chip Multithreaded Processors
This dissertation addresses operating system thread scheduling for chip multithreaded processors. Chip multithreaded processors are becoming mainstream thanks to their superior performance and power characteristics. Threads running concurrently on a chip multithreaded processor share the processor’s resources. Resource contention, and accordingly performance, depends on characteristics of the c...
متن کاملBrief Announcement: Parallel Depth First vs. Work Stealing Schedulers on CMP Architectures
1. ABSTRACT In chip multiprocessors (CMPs), limiting the number of off-chip cache misses is crucial for good performance. Many multithreaded programs provide opportunities for constructive cache sharing, in which concurrently scheduled threads share a largely overlapping working set. In this brief announcement, we highlight our ongoing study [4] comparing the performance of two schedulers desig...
متن کاملThread Scheduling For Shared Caches ECE 742 Final Project Report
Simultaneous multithreading (SMT) processors and chip multiprocessors (CMP) with shared caches usually require a primary cache increase by a factor proportional to the number of execution contexts to retain the cache performance of the uniprocessor. In this paper we study depth-first task scheduling, which was recently shown to reduce the number of cache misses when a single multithreaded appli...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2005